منابع مشابه
Efficient RSS Feed Generation from html Pages
Although RSS demonstrates a promising solution to track and personalize the flow of new Web information, many of the current Web sites are not yet enabled with RSS feeds. The availability of convenient approaches to “RSSify” existing suitable Web contents has become a stringent necessity. This paper presents EHTML2RSS, an efficient system that translates semi-structured HTML pages to structured...
متن کاملData-rich Section Extraction from HTML pages
The paper is about a novel algorithm, DSE (Datarich Subtree Extraction) to recognize and extract the datarich section of an HTML page. The DSE algorithm is used for two typical web information retrieval problems: topic distillation and web information extraction. The DSE algorithm has been developed by Jiying Wang from the University of Science & Technology in Hong Kong. Introduction Many Inter...
متن کاملHiding Inside HTML and Other Source Codes
Many steganographic techniques [1] [2] [3] [4] were proposed for hiding secret message inside images, the simplest of them being the LSB data hiding [6] [7] [8] [9] [10], [11]. In this paper, we suggest a novel data hiding technique in an Html Web page [12] and also propose some simple techniques to extend the embedding technique to source codes written in any programming language (both case in...
متن کاملMining Tables from Large Scale HTML Texts
Table is a very common presentation scheme, but few papers touch on table extraction in text data mining. This paper focuses on mining tables from large-scale HTML texts. Table filtering, recognition, interpretation, and presentation are discussed. Heuristic rules and cell similarities are employed to identify tables. The F-measure of table recognition is 86.50%. We also propose an algorithm to...
متن کاملWeb-scale profiling of semantic annotations in HTML pages
The vision of the Semantic Web was coined by Tim Berners-Lee almost two decades ago. The idea describes an extension of the existing Web in which “information is given well-defined meaning, better enabling computers and people to work in cooperation” [Berners-Lee et al., 2001]. Semantic annotations in HTML pages are one realization of this vision which was adopted by large numbers of web sites ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: JOURNAL OF EDUCATION AND SCIENCE
سال: 2013
ISSN: 2664-2530
DOI: 10.33899/edusj.2013.89900